Frequency Domain-Based Detection of Generated Audio
نویسندگان
چکیده
Attackers may manipulate audio with the intent of presenting falsified reports, changing an opinion a public figure, and winning influence power. The prevalence inauthentic multimedia continues to rise, so it is imperative develop set tools that determines legitimacy media. We present method analyzes signals determine whether they contain real human voices or fake (i.e., generated by neural acoustic waveform models). Instead analyzing directly, proposed approach converts into spectrogram images displaying frequency, intensity, temporal content evaluates them Convolutional Neural Network (CNN). Trained on both genuine voice synthesized signals, we show our achieves high accuracy this classification task.
منابع مشابه
Newborn EEG Seizure Detection Based on Interspike Space Distribution in the Time-Frequency Domain
This paper presents a new time-frequency based EEG seizure detection method. This method uses the distribution of interspike intervals as a criterion for discriminating between seizure and nonseizure activities. To detect spikes in the EEG, the signal is mapped into the time-frequency domain. The high instantaneous energy of spikes is reflected as a localized energy in time-frequency domain. Hi...
متن کاملWide-Band Audio Coding Based on Frequency-Domain Linear Prediction
In this paper, we re-visit an original concept of speech coding in which the signal is separated into the carrier modulated by the signal envelope. A recently developed technique, called frequency domain linear prediction (FDLP), is applied for the efficient estimation of the envelope. The processing in the temporal domain allows for a straightforward emulation of the forward temporal masking. ...
متن کاملProgress in LPC-based frequency-domain audio coding
This is an Open Access article, distributed under the terms of the Creative Commons Attribution-NonCommercial-NoDerivatives licence (http://creativecommons.org/ licenses/by-nc-nd/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is unaltered and is properly cited. The written permission of Cambridge University Press must be obta...
متن کاملRobust Audio Watermarks in Frequency Domain
In this paper an audio watermarking technique is presented, using log-spectrum, dirty paper codes and LDPC for watermark embedding. This technique may be used as a digital communication channel, transmitting data at about 40 b/s. It may be also applied for hiding a digital signature, e.g., for copyright protection purposes. Robustness of the watermarks against audio signal compression, resampli...
متن کاملDrum Detection from Polyphonic Audio via Detailed Analysis of the Time Frequency Domain
This publication presents a method for the automatic detection and classification of three distinct drum instruments in real world musical signals. The regarded instruments are kick, snare and hi-hat as agreed by the participants of the contest category Audio Drum Detection within the 2nd Annual Music Information Retrieval Evaluation eXchange (MIREX 2005). There are two challenging issues inher...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IS&T International Symposium on Electronic Imaging Science and Technology
سال: 2021
ISSN: ['2470-1173']
DOI: https://doi.org/10.2352/issn.2470-1173.2021.4.mwsf-273